Relational Approach to XPath Query Optimization
نویسنده
چکیده
This thesis contributes to the Pathfinder project which aims at creating an XQuery compiler on top of a relational database system. Currently, it is being implemented on top of MonetDB, a main memory database system. For optimization and portability purposes, Pathfinder first compiles an XQuery expression into its own relational algebra, before translating the query into the query language of the underlying system (in this case MIL, the MonetDB Interpreter Language). This thesis focusses on the optimizability aspect of the algebra. Pathfinder’s relational algebra is built on a set of basic relational (comparable to SQL) operators extended with the staircase join operator, an operator specifically designed for evaluation of XPath axis steps. By formally specifying the relational model of Pathfinder and all operators of its algebra, this thesis shows how they can be exploited for optimization. The main focus is on the staircase join operator, which, as it seems, does not provide enough rewriting flexibility in its current implementation. Therefore, a new staircase join definition is proposed: the symmetric staircase join.
منابع مشابه
Relational Approach to Logical Query Optimization of XPath
To be able to handle the ever growing volumes of XML documents, effective and efficient data management solutions are needed. Managing XML data in a relational DBMS has great potential. Recently, effective relational storage schemes and index structures have been proposed as well as special-purpose join operators to speed up querying of XML data using XPath/XQuery. In this paper, we address the...
متن کاملVAMANA : A High Performance, Scalable and Cost Driven XPath Engine
Many applications are migrating or beginning to make use native XML data. We anticipate that queries will emerge that emphasize the structural semantics of XML query languages like XPath and XQuery. This brings a need for an efficient query engine and database management system tailored for XML data similar to traditional relational engines. While mapping large XML documents into relational dat...
متن کاملProcessing XPath Expressions in Relational Databases
Research on processing XML documents gained much activity in recent times. XML query languages are mostly based on XPath expressions, which are used to select parts of XML documents. So, methods defining semantics for XPath expressions are of special importance. In the paper we propose a relational semantics for XPath expressions. The semantics consists of four semantic functions defined over s...
متن کاملXML Data Storage and Query Optimization in Relational Database by XPath Processing Model
XML is de facto new standard for data representation and exchanging on the web. Along with the growth of XML data, traditional relational databases support XML data processing across-the-board. Consistent storage and efficient query for XML data is the chief problem in XML supported relational databases. This work presents mechanisms of Storage and query optimization for XML data in relational ...
متن کاملPathfinder Meets DB2 Relational XQuery Optimization Techniques
We are taking the next big step towards the goal of a purely relational XQuery implementation. The Pathfinder XQuery compiler has been enhanced by a code generator that emits SQL. This code generator targets off-the-shelf relational database systems (e.g., DB2) and turns them into efficient and scalable XQuery processors. Our approach neither depends on modifications of the database kernel, nor...
متن کامل